Dynamic tuning of language model score in speech recognition using a confidence measure
نویسندگان
چکیده
Speech recognition errors limit the capability of language models to predict subsequent words correctly. An effective way to enhance the functions of the language model is by using confidence measures. Most of current efforts for developing confidence measures for speech recognition focus on applying these measures to the final recognition result. However, using these measures early in the search process may guide the search to more promising paths. In this work we propose to use a word-based acoustic confidence metric estimated from word posterior probability to dynamically tune the contribution of the language model score. The performance of this approach was tested on a conversational telephone speech corpus and results show significant reductions in recognition error rates.
منابع مشابه
مقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملConfidence measures for dialogue management in the CU Communicator system
This paper provides improved confidence assessment for detection of word-level speech recognition errors and out-ofdomain user requests using language model features. We consider a combined measure of confidence that utilizes the language model back-off sequence, language model score, and phonetic length of recognized words as indicators of speech recognition confidence. The paper investigates ...
متن کاملAn understanding strategy based on plausibility score in recognition history using CSR confidence measure
Although car-navigation systems attract attention as one of spoken dialogue interfaces, recognition errors due to the influence of natural speech and surrounding noise may prevent a smooth dialogue and disappoint the user. Thus, this research aims at the construction of a dialogue system which can achieve a smooth dialogue and a high degree of user satisfaction. Our system performs language und...
متن کاملSemirings Modeling Confidence and Uncertainty in Speech Recognition
As usual in speech recognition, in this paper I restrict attention to part of the first meaning, namely to “the belief that one can rely on something”. To get a feeling for this concept, assume that we have an automatic speech recognition system which is configured with a language model giving Bayesian prior probabilities to certain word sequences. If a user says something to the system, the sy...
متن کاملKeyword spotting for highly inflectional languages
This paper presents our new keyword spotting system taking advantage of both the filler model and the confidence measure approaches. The novelty is in a non-standard connection of the filler and the keyword models together with introduction of a new confidence measure based on a keyword normalized score. In detail the paper deals with a decision block. Two methods are introduced. The first is b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002